# Image embedding extraction
Vit Small Patch14 Reg4 Dinov2.lvd142m
Apache-2.0
A visual Transformer (ViT) image feature model with registers, pre-trained using the self-supervised DINOv2 method on the LVD-142M dataset.
Image Classification
Transformers

V
timm
15.98k
5
Vit Large Patch16 224 In21k
Apache-2.0
A Vision Transformer model pretrained on the ImageNet-21k dataset, suitable for image feature extraction and downstream task fine-tuning.
Image Classification
V
google
92.63k
26
Featured Recommended AI Models